Speaker adaptive voice source modeling with applications to speech coding and processing
نویسندگان
چکیده
We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the dentification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the ata, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The pplication of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments how that copy synthesis is perceptually very similar to the target, and that time stretching and “pitch extrapolation” effects can be btained by simple control strategies. 2014 Elsevier Ltd. All rights reserved.
منابع مشابه
طراحی یک روش آموزش ناموازی جدید برای تبدیل گفتار با عملکردی بهتر از آموزش موازی
Introduction: The art of voice mimicking by computers, has with the computer have been one of the most challenging topics of speech processing in recent years. The system of voice conversion has two sides. In one side, the speaker is the source that his or her voice has been changed for mimicking the target speaker’s voice (which is on the other side). Two methods of p...
متن کاملExperiments in voice quality modification of natural speech signals: the spectral approach
Voice quality is currently a key issue in speech synthesis research. The lack of realistic intra-speaker voice quality variation is an important source of concern for concatenation-based synthesis methods. A challenging problem is to reproduce the voice quality changes that are occuring in natural speech when the vocal e ort is varying. A new method for voice quality modi cation is presented. I...
متن کامل0 Voice Conversion
Voice conversion (VC) is an area of speech processing that deals with the conversion of the perceived speaker identity. In other words, the speech signal uttered by a first speaker, the source speaker, is modified to sound as if it was spoken by a second speaker, referred to as the target speaker. The most obvious use case for voice conversion is text-to-speech (TTS) synthesis where VC techniqu...
متن کاملA Review of Glottal Waveform Analysis
Glottal inverse filtering is of potential use in a wide range of speech processing applications. As the process of voice production is, to a first order approximation, a source-filter process, then obtaining source and filter components provides for a flexible representation of the speech signal for use in processing applications. In certain applications the desire for accurate inverse filterin...
متن کاملVoice Conversion
Voice conversion (VC) is an area of speech processing that deals with the conversion of the perceived speaker identity. In other words, the speech signal uttered by a first speaker, the source speaker, is modified to sound as if it was spoken by a second speaker, referred to as the target speaker. The most obvious use case for voice conversion is text-to-speech (TTS) synthesis where VC techniqu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 28 شماره
صفحات -
تاریخ انتشار 2014